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Abstract 



In this paper we consider the following modification of the iterative search problem. We are 

X^y • given a tree T, so that a dynamic catalog C{v) is associated with every tree node v. For any 

X and for any node-to-root path n in T, we must find the predecessor of x in Ut,g^C(u). We 

present a linear space dynamic data structure that supports such queries in 0{t{n) -\- \'k\) time, 

Q \ where t{n) is the time needed to search in one catalog and |7r| denotes the number of nodes on 

path TT. 

We also consider the reporting variant of this problem, in which for any xi, X2 and for 
^ ■ any path tt', all elements of U^gjr' (C'(i') H [a;i,a:2]) must be reported; here tt' denotes a path 

10 \ between an arbitrary node uq and its ancestor vi. We show that such queries can be answered 

in 0(t(n) + Itt'I -|- fc) time, where k is the number of elements in the answer. 

To illustrate applications of our technique, we describe the first dynamic data structures 
for the stabbing-max problem, the horizontal point location problem, and the orthogonal line- 
[_~ I segment intersection problem with optimal 0(logn/loglogri) query time and poly- logarithmic 

^^ ■ update time. 
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1 Introduction 

The situation when we must search for the position of a vakie x in many ordered sets frequently 
arises in data structures and computational geometry problems. The brute force approach of 
searching for x in every set "from scratch" can be improved if there are restrictions on the order in 
which the sets can be searched. Such improvements for some important problems were suggested 
by several researchers, see e.g., [MIES]. Chazelle and Guibas described in their seminal paper [11] a 
general data structuring technique, called fractional cascading, that addresses the general problem 
of searching in multiple sets. The fractional cascading technique solves the following iterative search 
problem: We are given a graph G, called the catalog graph, so that an ordered set C{v) C U, called 
a catalog, is associated with every graph node. A query consists of an element x £ U and a 
subgraph G' of G. The goal is to find the predecessoio of x in each catalog G{v) for v G G'. In this 
paper we consider the following modification of the iterative search, further called multiple catalog 
searching problem: the graph G is a rooted tree, the subgraph G' is a node-to-root path vr, and 
we must search in the union of all catalogs C{v), v £ tt. We also consider the reporting variant, 
further called multiple catalog reporting, in which all elements e G C{v), v G tt, that belong to the 
query range [xi,2;2] must be reported. 

Although the problems addressed in this paper are more restrictive than iterative searching, 
they can be applied in many situations in which iterative searching is traditionally used. We show 
that multiple catalog searching and reporting queries can be answered by spending constant time 
in each node f of vr if vr is sufficiently large (ignoring the time to output all elements in the answer) . 
This enables us to obtain for the first time dynamic data structures with optimal query time and 
poly-logarithmic update time for point location in a set of horizontal segments, stabbing-max, and 
orthogonal line-segment intersection reporting. 

Previous and Related Work. Chazelle and Guibas [11] showed that it is possible to identify 
the predecessor of x in C{v) for each catalog C{v), v G G', in 0{t{n) + \G'\) time, where n denotes 
the total number of elements in all catalogs, \G'\ is the number of nodes in G' , and t{n) is the time 
needed to search in one catalog. The dynamic version of the fractional cascading is considered by 
Mehlhorn and Naher [23]; in [23] the authors described how to support insertions into and deletions 
from a catalog C{v) in O(loglogn) time if a pointer to the deleted element x or the predecessor of 
an inserted element x is given; the data structure of [23] supports queries in 0{t{n) + \G'\ log log n) 
time, i.e., the search takes O(loglogn) time in each node of G' . Imai and Asano [19] considered the 
semi-dynamic scenario, when new elements can be inserted but deletions are not supported. The 
result of [19] can be used to support insertions in 0(log* n) time and search in 0{t{n) + \G'\ log* n) 
time in the pointer machine model [31]; another result of [19] can be used to support insertions 
in 0(1) time and search in 0{t{n) + |G'|) time in the RAM model. Since [ T9l I23j . the dynamic 
fractional cascading was applied to a number of data structure problems, e.g., point location, range 
reporting, and segment intersection. The technique was also extended e.g., to support iterative 
search in graphs with super-constant local degree [30] and to the case when elements stored in 
different catalogs belong to different ordered sets, e.g. [3 [5]. However, there is no currently known 
dynamic data structure that supports iterative search in o(log log n) time per catalog (ignoring the 
0{t{n)) term). Since fractional cascading relies on the union-split-find queries, and union-split-find 
queries cannot be answered in o(loglogn) time [21], it appears that we must spend r2(loglogn) 
time in each node to solve the iterative searching problem. 



^The predecessor of x in S, denoted pred(a;, S), is the largest e £ S such that e < x; the successor of x in S, 
denoted succ(a::, 5"), is the smallest e £ S such that e > x. 



Our Results. The fractional cascading |llj technique and its variants for the dynamic and 
semi-dynamic scenarios [231 [T9] can be apphed when the catalog graph is an arbitrary graph with 
locally bounded degree (e.g., any graph with bounded degree; see [llj for precise definition). In our 
scenario the catalog graph is a rooted tree and all catalogs C{v) for all nodes v on the path vr must 
be searched. Moreover, instead of searching for x in each catalog, we search in all catalogs. That is, 
the query consists of a value x and a path vr between a node u and the root of the tree; the answer 
to the query is the predecessor p^ of x in the union of all catalogs on vr, px = pred(x,U„g7rC'(t')). 
Henceforth, such queries will be called multiple catalog searching queries. We obtain the following 
results with a linear space data structure: 

1. Multiple catalog searching queries can be answered in 0{t{n) + (l/e)|7r|) time, and updates are 
supported in O(log^n) time for any e > 0. 

2. Multiple catalog searching queries can be answered in 0(t(n) + |7r| logloglogn) time, and updates 
are supported in O(loglogn) time. 

Other trade-offs between query and update times are described in Theorem [TJ We assume that 
a pointer to the position of an inserted or deleted element in the data structure is known for the 
update operation. 

We also consider the multiple catalog reporting problem. A query consists of values xi,2;2 G U 
and a path tt from a node vq to a node vi, such that vi is the ancestor of vq. The answer to the 
query consists of all elements e G Uy£-,rC{v), such that xi < e < X2- 

1. Multiple catalog reporting queries can be answered in 0{t{n) + (l/e)|7r| + k) time, where k is 
the number of elements in the answer, and updates are supported in 0(log^ n) time for any e > 0. 

2. Multiple catalog reporting queries can be answered in 0{t{n) + |7r| log log log n + k) time, where 
k is the number of elements in the answer, and updates are supported in O(loglogn) time. 
Again, the space usage of our data structure is linear in the total number of elements in all catalogs. 
Other trade-offs between query and update times are described in Theorem [2l Dynamic range 
reporting in a single catalog was considered in [271 ES]. The data structure of [27l [26] supports 
queries and updates in 0(logloglogC/) and O (log log C/) time respectively, where U is the size of 
the universe. Another variant of their data structure supports queries in 0(1) time and updates in 
0(log^ U) time. Besides that, the data structure described in [27] uses randomization and relies on 
a more extensive set of basic arithmetic operations. 

Finally, we consider the multiple catalog maxima problem. A query consists of a path tt from 
a node vq to a node vi, such that vi is the ancestor of vq; we must output the maximal element in 
every catalog C{v), v ^ tt. For a tree with node degree 0(log ' n) such queries can be answered 
in 0(|7r|) time. Insertions and deletions are supported in O(loglogn) and 0((loglogn)^) time 
respectively. Moreover, in this case we extend the definition of update operations, so that an 
element can be simultaneously inserted into (deleted from) any catalogs C{vf), . . . ,C{vi) where 
Vf, ■ ■ ■ ,vi are sibling nodes. This result, described in section HI is obtained with a different, simpler 
technique. 

Applications. As an illustration of our technique, we present dynamic data structures for several 
problems that for the first time achieve 0(logn/loglogn) query time in the word RAM model. 
The marked ancestor problem [3] can be reduced to each of the problems described below, see [3]. 
In [3], the authors also show that any data structure with poly-logarithmic update time and poly- 
logarithmic word size needs ^(logn/loglogn) time to answer the marked ancestor problem. Hence, 
we obtain data structures with optimal query time for all considered problems. 
Horizontal Point Location. In the horizontal point location problem aka vertical ray shooting 



problem, a set of n horizontal segments is stored in the data structure, so that for a query point q 
the segment immediately below (or immediately above) q can be reported. Giyora and Kaplan [18j 
describe a linear space RAM data structure with O(logn) query and update times in the RAM 
model. We refer to [18] for a detailed description of previous results. Although the O(logn) time 
is optimal if we can manipulate segments by comparing their coordinates, the query time can be 
improved in the word RAM model. In this paper we present a data structure that supports queries 
in 0(log n/ log log n) time and updates in 0(log^^^ n) amortized time; our data structure uses 0{n) 
space. As explained above, this query time is optimal. 

Retroactive Searching. In the retroactive searching problem, introduced by Demaine et.al. |14] . 
the data structure maintains a sequence of keys. Each key can be inserted at time tj and deleted 
at time to > tj. The answer to a query {q, t) is the element that precedes q at time t. It was shown 
in |18| that retroactive searching is equivalent to the horizontal point location problem. Thus our 
result for horizontal point location demonstrates that retroactive searching queries can be answered 
in 0(logn/loglogn) time in the word RAM model. 

Stabbing-Max Data Structure. In the stabbing-max problem, we maintain a set of axis-parallel 
d-dimensional rectangles, and each rectangle s has priority ps- Given a query point q, the stabbing- 
max data structure finds a rectangle with maximum priority that contains q. The one-dimensional 
data structure of Kaplan, Molad, and Tarjan [21] supports queries and insertions in O(logn) time, 
deletions in O(lognloglogn) time, and uses 0{n) space. The data structure of Agarwal, Arge, and 
Yi [1] also uses linear space and supports queries and updates in O(logn) time. See |2T1[T1|33] for 
a more extensive description of previous results. 

In this paper we describe two data structures that support one-dimensional stabbing-max 
queries in optimal 0(logn/loglogn) time. The first data structure uses 0(n log n/ log log n) space 
and supports insertions and deletions in O(logn) and O(lognloglogn) time respectively. The sec- 
ond data structure uses 0{n) space but supports updates in 0(log^+^n) time. 
Orthogonal Line-Segment Intersection. In this problem a set of horizontal segments is stored 
in a data structure, so that for a vertical query segment s„ all segments that intersect with s^ can 
be reported. The data structure of Cheng and Janardan [12] supports such queries in 0(log n + k), 
where k is the number of segments in the answer. Mehlhorn and Naher reduced the query time to 
0(lognloglogn + A;) using dynamic fractional cascading. The fastest previously known data struc- 
ture of Mortensen |25] supports queries and updates in 0(logn + A:) and O(logn) time respectively 
and uses 0(n log n/ log log n) space. In this paper we present a 0(n log n/ log log n) space data 
structure that answers queries in 0(logn/loglogn + k) time and supports updates in 0(log ^^ n) 
time. 

All results presented in this paper are valid in the word RAM computation model. We assume 
that every element (resp. every point) fits into one machine word and that additions, subtractions, 
and bit operations can be performed in constant time. We also assume that the most significant 
bit (MSB) of an integer can be found in 0(1) time. It is possible to find MSB in 0(1) time using 
AC^ operations [4]. Throughout this paper e denotes an arbitrarily small positive constant. 

2 Main Idea 

In this section we sketch the main ideas of our approach. We start by showing how the fractional 
cascading technique can be used to solve the multiple catalog searching problem. Then, we describe 
the difference between the fractional cascading and our approach. 



We construcio augmented catalogs AC{v) 5 C{v) for all nodes v of T starting at the root. For 
the root vr, AC{vr) = C^vr). If AC{u) for a node u is already constructed, then AC{uj) for a 
child Uj of u consists of some elements from AC{u) and all elements of C{uj). Elements of C{u) and 
AC{u) \ C{u) are called proper elements and improper elements respectively. For every improper 
element e € AC{u), there is a copy e' of e that is stored in j4C(parent(ti)). Elements e and e' are 
provided with pointers to each other and are called a bridge. 

We want to organize the search procedure in such way that only a small number of ele- 
ments in every visited node must be examined. Using fractional cascading, we can guarantee 
that there are 0{d) elements of AC{w) between any two improper elements of AC{v), where v is 
any node of T except of the root and w is the parent of v. Each element stored in the augmented 
catalog AC{v) belongs either to C{v) or to a catalog C{w) for some ancestor w of v. Hence, 
U^g7r^C(u) = U^,g7rC'('y) for any node-to-root path tt = vo,vi, . . . ^vr. Therefore multiple catalog 
searching (unlike general iterative searching) is equivalent to finding the predecessor in yj,ije-KAC{v). 
This suggests the following method for searching in [Jy^T^C{v): The search procedure starts by 
identifying p{vq) = pred(x,^C(t'o)) and s{vq) = smcc{x , AC {vq)) . For every node Vi, i > 0, we 
find p{vi) = pred(a;, AC{vq) U . . . U AC{vi)) and s{vi) = succ(x, AC{vq) U . . . U AC{vi)). Suppose 
that p{v,i) and s{vi) for some z > are known. To identify p{vi+i) and s{vi+i), we only need to 
examine elements of AC{viJ^i) that belong to the interval \p{vi),s{vi)]. Since there is no element 
e G AC{vi) between p{vi) and s{vi), there are 0{d) elements of AC{vi+i) between p{vi) and s(vj), 
where d is the maximal node degree of T. For d = log^ ' n, we can search in a set of 0{d) elements 
in 0(1) time [16]. 

The only issue is how to quickly find elements of AC{viJ^i) that are between p{vi) and s{vi). Let 
hi{vi) be the bridge that precedes p{vi) and let 62(^1) be the bridge that follows s{vi); there are 0{d) 
elements between hi{vi) and 62(^4) in AC{vi+i). Bridges hi{vi) and h2{vi) can be found by storing 
proper and improper elements of AC{vi) as proper and auxiliary elements in a union-split-find data 
structure. Unfortunately, we would need il(loglogn) time to identify 61 (fj) because of the lower 
bound of [23]. 

Our solution does not rely on bridges and union-split-find data structures during the search 
procedure. Instead, we construct additional catalogs AC{u) in each node u. Catalogs AC{v) are 
constructed in a leaf-to-root order: for a leaf node u;, AC{ui) = AC{ui); for an internal node w, 
AC{u) contains all elements of AC{u) and some elements from catalogs AC{uj)^ where Uj are the 
children of u. We guarantee that at least one element from a sequence of log^ ' n consecutive 
elements in AC{u) also belongs to AC (parent (n)) for any node u. This allows us to identify the 
elements hi{vi) and 62(^4) that precede p{vi) and follow s{v,j) in AC{vi) n AC{yi+i)) in 0(1) time. 
Hence, we can quickly navigate from a node to its parent. On the other hand, a catalog AC{vi), 
i > 1, can contain a large number of elements that are not relevant for the search procedure, i.e., 
elements from some catalogs C{w), such that u; is a descendant of Vi and w ^ tt. Hence, there can 
be an arbitrarily large number of elements in AC{vi) n [61 (fj), b2{vi)]. However, we can show that 
the number of elements in AC{vi) n [bi{vi), b2{vi)] is bounded. 

We store a data structure R{v) in every node v of T. For any two elements ei E AC{v) and 
62 G AC{v), the data structure R{v) identifies an element 63 G AC{v) such that ei < 63 < 62, 
or determines that such 63 does not exist. The data structure R{v) combines the approach of the 
dynamic range reporting data structure [271 ES] and the labeling technique [201 ES]; details can 



^ We describe a simplified version of the fractional cascading technique because we are only interested in searching 
catalogs that lie on a node-to-root path. 
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Figure 1: An example of blocks in catalogs AC{v) and AC(parent(t')). Elements AC{v) \ C{v) 
and elements of AC{v) are depicted with black circles and light blue circles respectively. Proper 
elements of AC{v) are shown with green circles and all other elements of AC{v) are shown with 
white circles. 



be found in section [3l Using R{v), we can identify relevant elements in every visited node on the 
search path vr. 

3 Multiple Catalog Searching 

Overview. In this section we give a more detailed description of the approach sketched in section[2j 
Every node v contains a catalog C{v) and an augmented catalog AC{v) D C{v). For the root vr, 
C{vr) = AC{vji). The augmented catalog for a non-root node v contains some elements from 
catalogs AC{w) for ancestors w of v, so that the following statement is true: 

Property 1 Let Ui be a child of an internal node u. Suppose that for two elements ei G AC{u) 
and 62 € AC{u) there is no 63 G AC{u) n AC(parent(n)) such that ei < 63 < 62. Then there are 
0{d) elements e E j4C(parent(ti)), such that ei < e < 62- 

Here and further d denotes the maximal node degree of T. Using the standard fractional cascading 
technique [23tl30j. we can construct and maintain catalogs AC{v) that satisfy Property [H 

Each node v also contains a catalog AC{v) D AC{v). Each catalog AC{v) is subdivided into 
blocks Bi, so that (1) any element in a block Bi is smaller than any element in a block Bj for 
i < J, and (2) each block, except of the last one, contains more than log n/2 and less than 
2 log n elements; the last block contains at most 2 log n elements. The set AC{v) consists of first 
elements from each block Bi of AC{v). For a leaf node vi, AC{vi) = AC{vi). For an internal 
node n, AC{u) = AC{u) U {UiAC{ui)), where the union is taken over all children m of u. Thus 
AC'(f) n]4C'(parent(?;)) = AC{v) U {AC{v) \ C{v)). Each element of AC{v) and each element in 
AC{v) \ C{v) contains a pointer to its copy in ylC(parent(7;)) called the up-pointer. We denote by 
UP{v) the set of all elements in AC{v) that have up-pointers, i.e., UP{v) = AC {v)r\AC {'pa.-reTit{v)) . 
Thus UP{v) consists of first elements in every block of AC{v) and improper elements from AC{v). 
See Fig. [TJ By a slight misuse of notation, we will sometimes denote the elements of UP{v) as 
up-pointers. For each block Bi of every catalog AC{v), we store a data structure Bi that enables 
us to find for any e £ Bi the largest element / G {Bi n UP{v)) such that / < e. Such queries can 
be supported in 0(1) time because a block contains a poly-logarithmic number of elements. 



We store in each node v a data structure R{v) that enables us to find an element e G AC{v) 
between any two elements of AC{v), or determine that such e does not exist. The data structure 
The data structure R{v) and the following Property play a key role in our construction 

Property 2 Let bi and 62 be two elements of UP{v) such that there is no element f G AC{v) n 
UP{v) with bi < f < 62. Then the catalog AC {parent (v)) contains 0{d) elements e, such that 

61 < e < 62- 

Proof: Property [2] is a straightforward corollary of Property [1] Let ei = pred{bi,AC{v)) and 

62 = svLcc{b2,AC{v)). Since {AC (v) Ci AC {par ent{v))) C UP{v), {AC {v) D AC {par ent{v))) C 
{AC{v) n UP{v)). Hence, there is no element of AC{v) D ^C(pareiit(f)) between ei and 62- 
Therefore, by Property [H j4C(parent(t')) contains 0{d) elements e, such that ei < e < 62- Since 
ei < bi < b2 < e2, Property [2] is true. D 

We observe that Property [2] only bounds the number of elements in AC{parent{v)) D [bi, 62]- The 
number of elements in AC {parent {v)) n [61, 62] can be arbitrarily large. 

In the next part of this section we show how multiple catalog searching queries can be answered 
if Property [2] is satisfied. Then, we describe the data structure for a block and the data structure 
R{v). Finally, we describe the update procedure and sketch the analysis of the space usage and the 
update time. 

Search Procedure. Let vq be a node of T and let vr be the path from vq to the root vr of 
T. We will describe the procedure that identifies both the predecessor and the successor of x in 
U„(=7rC(t;). Li every node w G vr we identify elements p{v) = pred(x,Py) and s{v) = s\icc{x,Py), 
where P^ = UueTTvAC{u) and ir^ is the path from vq to v. Clearly, we can find p{vq) and s{vq) in 
time 0{t{n)). Let 6i(t'o) and 62(^0) be the up-pointers that precede and follow p{vo) and s{vo). 
Since each block contains at least one up-pointer, we can find 61 (fo) and 62(^0) ii^ 0{1) time. 

Suppose that we know p{vi), s{vi), bi{vi), and 62(^4) for some node Vi £ vr, so that bi{vi) < 
X < 62(^4) and bi{vi), 62(^2) are up-pointers that satisfy the condition of Property [2l We can 
find p(fj-|_i), s{vi^i), bi{vi+i), and 62(^1+1) for the parent Uj+i of Vi as follows. By Property O 
there are at most 0{d) elements of AC{vi^i) between bi{vi) and 62(^1). Since bi{vi) < p{vi) < 
s{vi) < b2{vi), elements of AC{vi^i) that do not belong to the interval [bi{vi),b2{vi)] are not 
relevant for our search. If AC{vi) n [bi{vi),b2{vi)] ^ 0, we can identify some e G AC{vi^i), 
bi{vi) < e < b2{vi), using the data structure i?(fj+i). We will show in the next paragraph how 
all elements in AC{vi+i) n [61 (fj), 62(^1)] oan be examined and compared with x, p{vi), and s{vi) 
in 0(1) time. Hence, we can identify p{vi+i) and s{vi+i) in time 0(1). The up-pointers bi{vi+i) 
and b2{vi+i) are the up-pointers that precede p{vi+i) in AC{vi+i) and follow s{vi-^.l) in AC{vi^i) 
respectively. Otherwise, if AC{vi) n [bi{v),b2{v)] = 0, p{vi+i) = p{vi) and s{viJ^i) = s{vi). In this 
case the up-pointers bi{vi+i) and b2{vi^i) are the up-pointers that precede bi{vi) and follow 62(^4) 
respectively. Since every element in AC{v) belongs either to C{v) or to C{w) for some ancestor w 
of V, U^^TT AC {v) = U^gvrC(f). Hence if we know p^,^ and s^,^ for the root node vr, we also know 
pred{x,L)y(ZT,C{v)) = Pv^ and succ(x,U^e^C(f)) = s„^. 

It remains to show how we can find p{vi+i) and s(wj+i) if AC{vi) n [bi{vi),b2{vi)] 7^ and a 
pointer to some element e G AC{vi+i), bi{vi) < e < b2{vi), is given. Suppose that the maximal 
node degree d = 0{log^ n) for a constant g. We divide AC{v) for each v £ T into groups Cj so that 
each group contains at least log^ n and at most 4 log^ n elements and store the elements of each 
group in the atomic heap Qi of Fredman and Willard ^jSJ, so that predecessor queries and updates 
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Figure 2: Searching for the predecessor and the successor of a: in a node I'j+i. Elements AC{v)\C{v)) 
and elements of AC{v) are depicted with black circles and light blue circles respectively. Proper 
elements of AC{v) are shown with green circles and all other elements of AC(v) are depicted with 
white circles. Only relevant up-pointers are shown. 



are supported in 0(1) time [HI [32]. There are 0(1) groups Gj, such that Gj n [6i(uj), ^2(^1)] / ^■ 
Hence, we can find the largest index /, such that the first element in Gf is smaller than x in 0(1) 
time. Using Qf, we find the predecessor pf of x in Gf. If p/ is larger than p{vi), we set p{vi-^-i) = pf] 
otherwise p{vi+i) = p{vi). Hence, we can find p(wj+i) in 0(1) time. We can find s(fi+i) with the 
symmetric procedure. 

Thus our search procedure answers one query to a data structure R{v) in every node f G vr. All 
other operations take 0(1) time per node. 

Block Data Structure. Our data structure uses the fact that a block contains 0(log n) elements. 
Hence, each element of a block can be specified with O(loglogn) bits and information about 
Q{^J\ogn) elements can be packed into one machine word. We can use this fact to store information 
about all elements of a block in a tree with node degree 0(\/logn). Details are given below. 

We associate a unique stamp t{e) < 4 log n with each element e in B. The array A contains 
entries for all elements of B so that A[k] = e for t(e) = k. We rebuild A after 21og n update 
operations and assign an arbitrary stamp t{e) < \B\ to each e £ B. When a new element e' is 
inserted into B, we set t(e') = k' , where k' is the number of update operations since the last rebuild. 

We also store a B-tree Tg with node degree Q{y/\ogn) augmented as follows. Let S{'Wi) be 
the set of elements stored in a leaf wi. The word L{wi) contains the time-stamps and ranks of all 
elements in S{wi). We also associate a word (i.e., a sequence of O(logn) bits) M^ with each node 
w of T. The i-th bit in M{wi) for a leaf wi equals to 1 if the i-th. element of S{wi) belongs to 
UP{v). The i-th bit in M{w) for an internal node w equals to 1 if and only if at least one bit in 
M{wi) equals to 1, where Wi is the i-th child of w. For each word M{w) and for any i, we can find 
the largest j < i, such that the j-th bit of M{w) is set to 1. Using a look-up table of size o{n), 
common for all blocks, we can answer such queries in 0(1) time. For each element e, we store a 
pointer to the leaf wi of Tb^, such that e belongs to S{wi). 

Given an element e, we identify the leaf wi in which it is stored. Using L{wi) we identify the 
rank j of e in S{wi). This can be done in 0(1) time with standard bit operations. If there is at 
least one bit set to 1 among the first j — 1 bits of M{wi), we use L{wi) to identify the stamp of 



the element e' that corresponds to the A;-th bit in M{wi), where k is the index of the rightmost 
bit set to 1 among the first j — 1 bits of M{wi). Then, we find the element e' using the array A. 
Otherwise, we search for the rightmost leaf Ws, such that Wg is to the left of wi and S{ws) contains 
at least one bit set to 1. Since the height of Tb is 0(1), we can find Ws in 0(1) time. Then, we use 
L{ws) and A to identify the element corresponding to the rightmost bit set to 1 in Wg- 

When a new element is inserted, we insert an entry into the array A. Then, we identify the leaf 
wi in which e is stored and update the word L{wi). We also update the word M{wi) and the words 
M(wj) for all ancestors Wj of wi. The B-tree can be re-balanced in a standard way. Deletions are 
performed symmetrically. 

Data Structure R{v). Essentially, our data structure is based on the combination of the range 
reporting data structure of Mortensen, Pagh, and Patrascu [27] and the dynamic labeling scheme 
of [20, 36j. Using the method of [36], we can assign a positive integer label bounded by 0(1^0(^)1/ log^ n) 
to each block of AC{v), so that labels can be inserted and deleted in 0(log n) time. If a block 
contains at least one element from AC{v), then we store the label of this block in a data struc- 
ture i?" that supports one-dimensional range reporting queries. Using the result of [27], the data 
structure R^ supports queries in qr{n) time and updates in time Ur{n), where qr{n) and Ur{n) are 
arbitrary functions satisfying Ur{n) > loglogn, qr{n) < logloglogn, and 2'^'^^"'' = ©(log^^j-^) logn). 
For instance, queries and updates can be supported in 0(1) time and O(log'^n) time respectively. 
Alternatively, R^ can support queries in 0(log loglogn) time and updates in O(loglogn) time. Al- 
though the data structure [27] uses randomization and the update time is expected, we can obtain 
the data structure with the same deterministic worst-case update time by replacing all Bloomier 
filters with bit vectors. The space usage of this modified data structure is 0(|j4C(f)|/log^n). 

We can determine, whether there is an element e G AC{v) between two elements ei and 62 that 
belong to the same block Bi, using a data structure that is similar to the block data structure Bi. 

If ei and 62 belong to different blocks Bi and B2, we can determine whether there is an element 
e' such that e' G AC{v) D Bi and e' is larger than ei or e' G AC{v) n B2 and e' is smaller than 62 
as explained in the previous paragraph. If such e' does not exist, we check whether there is a block 
between Bi and B2 that contains at least one element of AC{v) using the data structure R^. If 
such a block B3 is found, we identify an element e' G AC{v) D B3. 

Space Usage and Updates. It was shown in [23] that all catalogs AC{v) contain 0{n) elements 
and an update on a catalog C{v) incurs 0(1) amortized updates of catalogs AC{u). An element 
e can be inserted into or deleted from a catalog AC{u) in O(loglogn) time if the position of (the 
predecessor of) the element e in AC{u) is known: see e.g., [23] . Applying the method of [23] to 
catalogs AC{v), we can show that all catalogs AC{v) also contain 0{n) elements, and an update 
of a catalog AC{v) incurs 0(1) updates of AC{w) for some nodes w. 

When a new element e is inserted into AC{v), we update the data structure for the block B 
that contains e; we also update the data structure R^ if e G AC{v). If the number of elements 
in a block equals to 21og n, we split the block into two blocks, so that each block contains log n 
elements, insert a new label for one of the newly created blocks, and update the data structure R^. 
When a new label is inserted, 0(log n) other labels may be changed. Hence, we must perform 
0(log n) updates of the data structure i?". Since a new label is inserted after 0(log n) insertions, 
the amortized cost of an insertion into R{v) is 0{ur). If e also belongs to AC{v) and e is the only 
element in e G AC{v) n B, then e must be inserted into a data structure Qj for some group Gj. If 
the number of elements in Gj equals to 4 log^ n, we split the group into two groups of equal size. 
Thus the amortized cost of an insertion into Gj is 0(1). Deletions are performed symmetrically. 



Since the update time is dominated by an update of the data structure K^, the total cost of an 
update operation is 0{ur{n)). 

During the search procedure, we must answer one query to a data structure R{v) in every node 
V £ tt; ah other operations can be performed in 0(1) time. Hence, a query can be answered in 
0{t{n) + \TT\qr{n)) time. The result of this section is summed up in the following Theorem. 

Theorem 1 We are given a tree T with maximal node degree d = log ' ' n, so that a catalog 
C{y) C U is associated with each node v, XIi^gt 1^(^)1 ~ "■• ^^^ Ur{n) and qr{n) he arbitrary 
functions satisfying Ur{n) > log log n, qr{n) < log log log n, and 2'^"'^'^' = 0(log„^(„) logn). There 
exists a data structure that answers multiple catalog searching queries in 0{t{n) + |vr|gr('T')) time, 
where t[n) denotes the time needed to search in one catalog of n elements. If a pointer to the 
(predecessor of) x in AC{v) is given, then x can he inserted or deleted in 0(loglogn + Ur{n)) 
amortized time. 

Two interesting choices oiur{n) and qr{n) are Ur{n) = log^n, qr{n) = const and Ur{n) = loglogn, 
qr{n) = log loglogn. Thus we can answer multiple catalog searching queries in 0{t{n) + |7r|) time 
and support updates in 0(log^ n) amortized time. We can also answer multiple catalog searching 
queries in 0{t{n) + |7r| log loglogn) time and support updates in O(loglogn) amortized time. 

3.1 Multiple Catalog Reporting 

In this subsection we describe how our data structure can be modified to report elements in the 
query interval [x;, x/^] for all catalogs C{v), where f is a node on a path vr. In this case tt is a path 
from a node vq to a node vi such that vi is the ancestor of vq. We observe that, unlike in the 
multiple catalog searching problem, vi is not necessarily the root of T. 

Theorem 2 We are given a tree T with maximal node degree d = log ' ^ n, so that a catalog 
C{v) C U is associated with each node v, '}2vgt\^('^)\ ~ "" ^^^ Ur{n) and qr{n) he arbitrary 
functions satisfying Ur{n) > loglogn, qr{n) < log loglogn, and 2'^^^"-' = 0{log^^/^^\logn). There 
exists a data structure that answers multiple catalog reporting queries in 0{t{n) + \TT\qr{n) + k) time, 
where t{n) denotes the time needed to search in one catalog of n elements and k is the number of 
points in the answer. If a pointer to the (predecessor of) x in AC{v) is given, then x can be inserted 
or deleted in O (loglogn + Ur{n)) amortized time. 

We maintain the catalog AC{v), the catalog AC{v), and the data structure R{v) in every node 
v S T as described in section [3l Moreover, every node v contains a data structure Rc{v): for any 
two elements ei and 62 in AC{v), Rc{v) identifies an element e' £ C{v), e\ < e' < 62, if such e' 
exists. Rc{v) is implemented in the same way as R{v). In every node v on the path tt, we identify 
Pi{v) G AC{v) and si{v) £ AC{v) such that pi{v) < xi < si{v) and there is no e G AC{v) with 
Pi{v) 1^ e 1^ si{v). We also identify Phiv) G AC{v) and Sh{v) G AC{v) such that Ph{v) < x^ < Sh{v) 
and there is no e G AC(v) with Ph{v) < e < Sh{v). For any e G AC{v), e G [xi,x/i] if and only if 
si{v) < e <Ph{v). 

We set pi{vo) = ^pred^xi, AC{vo)) and si{vq) = sncc{x i, AC (vq)). Given pi{v) and si{v) for some 
node v G TT, we identify the up-pointers bi{v) = Tpred{pi{v), UP{v)) and b2{v) = succ{si{v), UP{v)). 
Up-pointers bi{v) and 62(1') satisfy Property [2j Hence for the parent w of f, the catalog AC{w) 
contains at most r elements between bi{v) and 62 (w). We can search for an element e' G AC{w)^ 
bi{v) < e' < b2{v) using the data structure R{w). If such e' does not exist, we set pi{w) = 61(f) 
and si{w) = b2{v). Otherwise we examine 0{d) neighbors of e' in AC{w) and find pi{w) and si{w). 
We can identify Ph{v) and Sh{v) for all nodes w G vr in the same way. 
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Since C{v) C AC{v), any element e G C{v) belongs to the interval [x^jX/j] if and only if 
si{v) < e < Phiv). If C{v) n [si{v),ph{v)] / 0, we can find some e^ G C{v) D [si{v),ph{v)] using 
the data structure Rc{v). Then, we examine elements that follow Cm in C{v) until an element 
eh G C{v), Ch > Xh, is found. We also examine elements that precede em in C{v) until an element 
ei £ C{v), 6/ < xi is found. Thus we can report all elements in C{v) n [x^, x/j] in 0{\C{v) n [x^, x/jJI) 
time if si{v) and Ph{v) are known. Update time and space usage are the same as in the catalog 
searching data structure. 

4 Multiple Catalog Maximum Queries 

In this section we describe a simple data structure that enables us to identify the maximum element 
in each catalog C{v) for every node u G tt on a query path vr. Again, tt is a path from a node vq 
to a node vi such that vi is the ancestor of vq. In this section we assume that the maximum node 
degree of a node is d = 0(log ' n). 

Moreover, we can support extended update operations. An operation minsert(e, /, /,f) inserts 
an element e into catalogs C{vf),C{vf+i), . . . ,C{vi), where Vf,. . . ,vi are children of some node 
V. In this case we say that an element is associated with an interval [/, I] in the node v. We 
assume that each element is associated with at most one interval in every node v of T. An opera- 
tion mdelete(e, ?;) deletes an element e from all catalogs C{vf),C{vf^i), . . . ,C{vi), such that e is 
associated with an interval [/, I] in the node v G T. 

Theorem 3 We are given a tree T with maximal node degree d = log ' n, so that a catalog 
C{v) d U is associated with each node v, 'Ylv&T\^i'^)\ ~ ^- There exists a data structure that 
answers multiple catalog maxima queries in 0{t{n) + |7r|) time, where t{n) denotes the time needed 
to search in one catalog of n elements. If a pointer to (the predecessor of) x in L)AC{vi) is given, 
then minsert(x, /, /, v) and mdelete(x,ij) are supported in O(loglogn) time and 0((loglogn)^) 
time respectively. 

All elements from a catalog C{vi) are stored in a data structure D{v) for a parent v of Vi. Each 
element e in D{v) is associated with an interval [f,l], f < I, such that e is stored in all catalogs 
C{vf), . . . ,C{vi). We implement D{v) using the generalized union-split-find data structure de- 
scribed in Theorem 5.2 of [18J. This enables us to support the following operations: we can insert 
a new element e associated with an interval [cf, e/] into D{v) in O(loglogTi) time if the position of 
e in UC(fj) is known. We can delete an element e from D{v) in O(loglogn) time. For any interval 
[xi,X2], I < xi < X2 < d, and any q we can find the largest element e G D(v) such that e < q and 
[ef,ei] n [xi,X2] / 0. 

We store a local tree T(v) in every node v. Leaves of T{v) correspond to children of v; T{v) is 
a binary tree of height 0(log log n). We say that an element e covers a node u of T{v) if e is stored 
in all catalogs C{vi) for all children Vi of v. We say that e belongs to a node u of T{v) if e covers 
u but e does not cover the parent of u. The set Fu{v) contains all elements that belong to a node 
u of T{v). The data structure M(v) contains maximal elements from every set Fu{v). 

We can use the fact that M{v) contains O(logn) elements and implement it in one machine 
word, so that for any path 7r(f;) in T{v) the maximum element e G ^ue-wMFuiv) can be found in 
constant time. Updates of M{v) are also supported in constant time. M{v) is implemented as 
follows. Let max„ denote the maximum element in Fu{v). The word W{v) contains the rank of 
max„ in M{v) for every node u of T{v) (nodes of T{v) are stored in pre-order). Since ranks of all 
maxu fit into one machine word, we can modify the ranks of all elements in M{v) in Oil) time 
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when niax„ for some node u of T{v) is changed. Using table look-ups and bit operations on W{v), 
we can also find the maximum in ^uGirM^uiv) for any path 7r(i;) in T{v). 

Using data structures M{w) for the parent w of v, we can find the maximum element in C{v) 
for any node v in constant time. We can identify the maximum element in a catalog C{v) by finding 
the maximum element among max„ for u G 7r(v,w). Here Tr{v,w) denotes the path in T{w) from 
the leaf that corresponds to v to the root of T{v). Hence, we can find the maximum element in 
each C{v) in 0(1) time using M{v). 

When a new element e is inserted into catalogs C{vf), C{vf+i), . . . , C{vi), we insert e into the 
data structure D{v), where v is the parent node of Vf,...,vi. We can find O(loglogn) nodes 
til, . . .Us in T{v), such that e belongs to each Uj, I < j < s. For every Uj, if e > max„^. then we 
update the data structure M{v). Hence, an operation m±iisert{e, f,l,v) takes O(loglogn) time. 
When an element e is deleted from catalogs C{vf),C{vf^i), . . . , C{vi), we check whether e is stored 
as a maximum element max„ for some nodes u in M{v). For every such u, we find the largest 
element Cu < e such that e^ G Fu{v). Using the data structure D{v), we can find e^ in O(loglogn) 
time. When e^ is found, we update M{v) accordingly in 0(1) time. Finally, we delete e from D{v) 
in O(loglogn) time. Hence, nidelete(e,t;) takes 0((loglogn)^) time. 

5 Applications 

Applications in which we associate ordered sets with each node of a balanced tree are a frequent 
topic in data structures. In many cases we want to search in all catalogs that are associated with 
nodes on a specified root-to-leaf path. Since a root-to-leaf path in a balanced tree consists of 
O(logn) nodes and t(n) = O(logn), where t{n) is the time we need to search in one catalog of 
n elements, Theorem [1] enables us to spend 0(1) time in each catalog. If the node degree of a 
balanced tree is 0(log'^n) for a constant c, then a root-to-leaf path consists of 0(logn/loglogn) 
nodes. Using fusion trees \15\ 116]. we can search in a single catalog in t{n) = 0(logn/loglogn) 
time. Hence, Theorems [T] and [2] enable us to spend 0(1) time in each catalog even in the case 
when the node degree is poly-logarithmic. Below we will sketch how our techniques can be used to 
obtain dynamic data structures for several important problems. 

Point Location in a Horizontal Subdivision. In this problem the set of n horizontal segments 
is stored in the data structure, so that for a query point q = {qx,Qy) the segment immediately 
below (or immediately above) q can be reported. As in [18] and several other point location data 
structures [71[5], our solution is based on segment trees. The leaves of a segment tree correspond 
to x-coordinates of segment endpoints. The range rng{v) of a node v is an interval [pi,Pr] where 
Pi is the x-coordinate stored in the leftmost leaf descendant of v and pr is the x-coordinate stored 
in the rightmost leaf descendant of v. We denote by proj{s) the projection of a segment s on 
the X-axis. A set S{v) is associated with each node v; S{v) contains all segments s such that 
rng(v) is contained in proj{s) but rn(7(parent(z;)) is not contained in proj{s). Each internal 
node in our segment tree has 0(log n) children for 5 = e/2. Hence, each segment belongs to 
0(log ^ n) sets S{v). If a g^; G proj{s) for some segment s, then s is stored in one of sets S{v), 
where v G n and vr is the path from the leaf that contains the successor of qx to the root of the 
segment tree. We store the y-coordinates of segments from S{v) in a catalog C{v). Hence finding 
a segment below (above) q = {qx-,%) is equivalent to searching for the predecessor (successor) 
of qy in Ut,g7rC(f). We apply the multiple catalog searching technique to catalogs C{v), so that 
we can search in U^,g7rC(^;) in 0(logn/loglogn) time and update a catalog C{v) in 0(log n) 
time. When a new segment S is inserted into the data structure, we insert the y-coordinate of 
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s into 0(log^^ n) catalogs C{vi), . . . , C{vm)- Using the standard fractional cascading technique, 
we can identify position of the y-coordinate ys of s in augmented catalogs AC{vi), . . . , AC{vm) in 
0(log "*" log log n) time. Then we can insert ys into the multiple catalog searching data structure 
in 0(log nlog "*" n) = 0(log ^^ n) by Theorem [TJ Deletions are supported in the same way. The 
data structure uses 0(n log^"*"^ n) space. But we can reduce the space usage to linear by using the 
technique described in [18] and the technique of [7] . Details will be given in the full version of this 
paper. 

Stabbing-Max Data Structure. We use the same construction as in the point location data 
structure, but catalogs C{v) contain priorities of segments stored in S{v). For this problem, we 
use Theorem [3l To find the segment s with the highest priority such that x G s, we identify 
the maximum element in catalogs Uv^nCiv)- Suppose that a new segment s is inserted. All 
nodes v, such that s is stored in S{v) can be divided into 0(logn/loglogn) groups. The i-th group 
consists of sibling nodes Uij-, . . . , Ui^i- that have the same parent node Ui. Using standard fractional 
cascading, we can identify the position of s in all data structures D{ui) in O(logn) time. Then, 
we can use Theorem [3] to insert the segment s into D{ui) and to update M{ui) in O(loglogn) 
time for each Uj. Hence, the total time for an insertion is O(logn). Deletions are performed in 
a symmetric way, but we need 0((loglogn)^) time to update M(mj). Hence, the total time for a 
deletion is 0(log n log log n). We observe that it is not necessary to store catalogs C{y) and set S{y) 
in every node v. We only need to store the data structures D{v) and M{v) described in the proof of 
Theorem[3l Hence, the total space used by our construction is 0(n log n/ log log n). Thus we obtain 
aO{n log n/ log log n) space data structure that answers stabbing-max queries in 0(log n/ log log n) 
time, supports insertions in O(logn) time, and supports deletions in 0(log nlog log n) time. 

Alternatively, we can store C{v) using Theorem [TJ To identify the highest priority segment 
that contains a query point g, we search for the predecessor of pd in Uy^-,^C{v)] here pd denotes 
the dummy priority such that pd is larger than priority of any segment in the data structure. In 
this case update time and space usage can be estimated as for the horizontal point location data 
structure. Thus we obtain a 0{n) space data structure that supports queries in 0(logn/loglogn) 
time and updates in 0(log "''^n) time. 

Line-Segment Intersection. Again, we use the same construction as in the point location data 
structure. But now we use Theorem [21 so that multiple catalog reporting queries can be answered. 
Given a vertical segment Sq with endpoints {xg,yi) and {xg,y2), each segment that intersect Sg 
belongs to some set S{v) for a node v € tt, where vr is the path from the node that contains the 
successor of Xg to the root of the segment tree. A segment s £ S{v), v G n, intersects with Sg if 
and only if the y-coordinate of s belongs to the range [yi, ^2]- Hence, we can find all segments that 
intersect Sg by answering a multiple catalog reporting query. As in the previous case the update 
time is 0{log^~^^ n). We need 0(n log^"*"^ n) space to store all segments. But we can reduce the 
space usage to 0(n log n/ log log n) by using the technique similar to the compact representation 
described in [9]. Details will be given in the full version. 
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